Goto

Collaborating Authors

 isp pipeline



AdaptiveISP: Learning an Adaptive Image Signal Processor for Object Detection

Neural Information Processing Systems

Image Signal Processors (ISPs) convert raw sensor signals into digital images, which significantly influence the image quality and the performance of downstream computer vision tasks. Designing ISP pipeline and tuning ISP parameters are two key steps for building an imaging and vision system. To find optimal ISP configurations, recent works use deep neural networks as a proxy to search for ISP parameters or ISP pipelines. However, these methods are primarily designed to maximize the image quality, which are sub-optimal in the performance of high-level computer vision tasks such as detection, recognition, and tracking. Moreover, after training, the learned ISP pipelines are mostly fixed at the inference time, whose performance degrades in dynamic scenes. To jointly optimize ISP structures and parameters, we propose AdaptiveISP, a task-driven and scene-adaptive ISP. One key observation is that for the majority of input images, only a few processing modules are needed to improve the performance of downstream recognition tasks, and only a few inputs require more processing. Based on this, AdaptiveISP utilizes deep reinforcement learning to automatically generate an optimal ISP pipeline and the associated ISP parameters to maximize the detection performance.



AdaptiveISP: Learning an Adaptive Image Signal Processor for Object Detection

Neural Information Processing Systems

Image Signal Processors (ISPs) convert raw sensor signals into digital images, which significantly influence the image quality and the performance of downstream computer vision tasks. Designing ISP pipeline and tuning ISP parameters are two key steps for building an imaging and vision system. To find optimal ISP configurations, recent works use deep neural networks as a proxy to search for ISP parameters or ISP pipelines. However, these methods are primarily designed to maximize the image quality, which are sub-optimal in the performance of high-level computer vision tasks such as detection, recognition, and tracking. Moreover, after training, the learned ISP pipelines are mostly fixed at the inference time, whose performance degrades in dynamic scenes. To jointly optimize ISP structures and parameters, we propose AdaptiveISP, a task-driven and scene-adaptive ISP.


ISP meets Deep Learning: A Survey on Deep Learning Methods for Image Signal Processing

da Silva, Matheus Henrique Marques, da Silva, Jhessica Victoria Santos, Arrais, Rodrigo Reis, Neto, Wladimir Barroso Guedes de Araújo, Lopes, Leonardo Tadeu, Bileki, Guilherme Augusto, Lima, Iago Oliveira, Rondon, Lucas Borges, de Souza, Bruno Melo, Regazio, Mayara Costa, Dalapicola, Rodolfo Coelho, Santos, Claudio Filipi Gonçalves dos

arXiv.org Artificial Intelligence

The Image Signal Processor (ISP) is a component of digital cameras capable of performing various tasks to improve image quality, as demosaicing, denoising, and white balance. The set of tasks performed by the ISP is called ISP pipeline, divided in preproccessing and postprocessing steps, and may differ from manufacturer to manufacturer [1]. Nowadays, Machine Learning is used to replace partially or the entire ISP pipeline. Particulary, Deep Learning is employed to replace ISP tasks, working on noise removal or some image feaure that hinders processing over the network. Deep Learning network provides an improvement in relation to computational efficiency and processing time. This survey paper aims to analyze recent studies, 27 research papers, that implemented Deep Learning based ISP pipeline.


DRL-ISP: Multi-Objective Camera ISP with Deep Reinforcement Learning

Shin, Ukcheol, Lee, Kyunghyun, Kweon, In So

arXiv.org Artificial Intelligence

In this paper, we propose a multi-objective camera ISP framework that utilizes Deep Reinforcement Learning (DRL) and camera ISP toolbox that consist of network-based and conventional ISP tools. The proposed DRL-based camera ISP framework iteratively selects a proper tool from the toolbox and applies it to the image to maximize a given vision task-specific reward function. For this purpose, we implement total 51 ISP tools that include exposure correction, color-and-tone correction, white balance, sharpening, denoising, and the others. We also propose an efficient DRL network architecture that can extract the various aspects of an image and make a rigid mapping relationship between images and a large number of actions. Our proposed DRL-based ISP framework effectively improves the image quality according to each vision task such as RAW-to-RGB image restoration, 2D object detection, and monocular depth estimation.